Fusion of Speech and Face by Enhanced Modular Neural Network

نویسندگان

Rahul Kala

Harsh Vazirani

Anupam Shukla

Ritu Tiwari

چکیده

Biometric Identification is a very old field where we try to identify people by their biometric identities. The field shifted to bi-modal systems where more than one modality was used for the identification purposes. The bimodal systems face problem related to high dimensionality that may many times result in problems. The individual modules already have large dimensionality. Their fusion adds up the dimensionality resulting in still larger dimensionality. In this paper we solve these problems by the introduction of modularity at these attributes. Here we divide various attributes among various modules of the modular neural network. This limits their dimensionality without much loss in information. The integrator collects the probabilities of the occurrences of the various classes as outputs from these neural networks. The integrator averages these probabilities from the various modules to get the final probability of the occurrence of each class. This averaging is performed on the basis of the efficiencies of the modules at the time of training. A module that is well trained is hence expected to give a better performance than the one which is not well trained. In this manner the final probability vector may be calculated. Then the integrator selects the class that has the highest probability of occurrence. This class is returned as the output class. We tested this algorithm over the fusion of face and speech. The algorithm gave good recognition of 97.5%. This shows the efficiency of the algorithm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

معرفی شبکه های عصبی پیمانه ای عمیق با ساختار فضایی-زمانی دوگانه جهت بهبود بازشناسی گفتار پیوسته فارسی

In this article, growable deep modular neural networks for continuous speech recognition are introduced. These networks can be grown to implement the spatio-temporal information of the frame sequences at their input layer as well as their labels at the output layer at the same time. The trained neural network with such double spatio-temporal association structure can learn the phonetic sequence...

متن کامل

Inverse modeling of gravity field data due to finite vertical cylinder using modular neural network and least-squares standard deviation method

In this paper, modular neural network (MNN) inversion has been applied for the parameters approximation of the gravity anomaly causative target. The trained neural network is used for estimating the amplitude coefficient and depths to the top and bottom of a finite vertical cylinder source. The results of the applied neural network method are compared with the results of the least-squares stand...

متن کامل

Face Detection with methods based on color by using Artificial Neural Network

The face Detection methodsis used in order to provide security. The mentioned methods problems are that it cannot be categorized because of the great differences and varieties in the face of individuals. In this paper, face Detection methods has been presented for overcoming upon these problems based on skin color datum. The researcher gathered a face database of 30 individuals consisting of ov...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Fusion of Speech and Face by Enhanced Modular Neural Network

نویسندگان

چکیده

منابع مشابه

معرفی شبکه های عصبی پیمانه ای عمیق با ساختار فضایی-زمانی دوگانه جهت بهبود بازشناسی گفتار پیوسته فارسی

Inverse modeling of gravity field data due to finite vertical cylinder using modular neural network and least-squares standard deviation method

Face Detection with methods based on color by using Artificial Neural Network

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

عنوان ژورنال:

اشتراک گذاری